Learn R Programming

pchc (version 1.2)

Variable selection for continuous data using the FBED algorithm: Variable selection for continuous data using the FBED algorithm

Description

Variable selection for continuous data using the FBED algorithm.

Usage

cor.fbed(y, x, ystand = TRUE, xstand = TRUE, alpha = 0.05, K = 0)

Value

A list including:

runtime

The duration of the process.

res

A matrix with the index of the selected variable, their test statistic value and the associated p-value.

info

A matrix with two columns. The cumulative number of variables selected and the number of tests for each value of K.

Arguments

y

The response variable, a numeric vector.

x

A matrix with the data, where the rows denote the samples and the columns are the variables.

ystand

If this is TRUE the response variable is centered. The mean is subtracted from every value.

xstand

If this is TRUE the independent variables are standardised.

alpha

The significance level, set to 0.05 by default.

K

The number of times to repeat the process. The default value is 0.

Author

Michail Tsagris.

R implementation and documentation: Michail Tsagris mtsagris@uoc.gr.

Details

FBED stands for Forward Backward with Earcly Dropping. It is a variation of the classical forward selection, where at each step, only the statistically significant variables carry on. The rest are dropped. The process stops when no other variables can be selected. If K = 1, the process is repeated testing sequentially again all those that have not been selected. If K > 1, then this is repeated.

In the end, the backward selection is performed to remove any falsely included variables. This backward phase has not been implemented yet.

References

Borboudakis G. and Tsamardinos I. (2019). Forward-backward selection with early dropping. Journal of Machine Learning Research, 20(8): 1-39.

See Also

pc.sel, mmpc, cortest, correls

Examples

Run this code
x <- matrix( rnorm(50 * 50), ncol = 50 )
y <- rnorm(50)
a <- pchc::cor.fbed(y, x)
a

Run the code above in your browser using DataLab